ASR for emotional speech: Clarifying the issues and enhancing performance

نویسندگان

Theologos Athanaselis

Stelios Bakamidis

Ioannis Dologlou

Roddy Cowie

Ellen Douglas-Cowie

Cate Cox

چکیده

There are multiple reasons to expect that recognising the verbal content of emotional speech will be a difficult problem, and recognition rates reported in the literature are in fact low. Including information about prosody improves recognition rate for emotions simulated by actors, but its relevance to the freer patterns of spontaneous speech is unproven. This paper shows that recognition rate for spontaneous emotionally coloured speech can be improved by using a language model based on increased representation of emotional utterances. The models are derived by adapting an already existing corpus, the British National Corpus (BNC). An emotional lexicon is used to identify emotionally coloured words, and sentences containing these words are recombined with the BNC to form a corpus with a raised proportion of emotional material. Using a language model based on that technique improves recognition rate by about 20%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Relationship between Emotional Intelligence and Directive Speech Acts Preference

Language and emotion are two related systems in use, in that one system (emotions) impacts the performance of the other (language). Both of them share their functionality in communication. Since the nature of foreign language classrooms is ideally interactional, emotional intelligence (EI) gains importance. The aim of this study was to find out whether one's total emotional quotient and its com...

متن کامل

Helpful Statistics in Recognizing Basic Arabic Phonemes

The recognition of continuous speech is one of the main challenges in the building of automatic speech recognition (ASR) systems, especially when it comes to phonetically complex languages such as Arabic. An ASR system seems to be actually in a blocked alley. Nearly all solutions follow the same general model. The previous research focused on enhancing its performance by incorporating supplemen...

متن کامل

Towards Robust Spontaneous Speech Recognition with Emotional Speech Adapted Acoustic Models

Speech signal in addition to the linguistic information contains additional information about the speaker: age, gender, social status, accent (foreign accent, dialects, etc.), emotional state, health etc. Some of these informational channels induce changes of the speech acoustic characteristics. This article presents evaluation of the ASR acoustic models (first trained on neutral, read speech) ...

متن کامل

An Examination of the Impact of Customer Relationship Management on Marketing Performance by Clarifying Mediating Role of Innovation and Marketing Memory

The aim of this study is to survey the effect of customer relationship management on marketing performance with regard to the mediating role of innovation and marketing memory in Insurance authority in Kerman province. Population in this research is managers and staff of Insurance corporates in Kerman province and the sample amounted to 252 that were estimated by relative random way and Cochran...

متن کامل

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

Neural networks : the official journal of the International Neural Network Society

دوره 18 4 شماره

صفحات -

تاریخ انتشار 2005

ASR for emotional speech: Clarifying the issues and enhancing performance

نویسندگان

چکیده

منابع مشابه

On the Relationship between Emotional Intelligence and Directive Speech Acts Preference

Helpful Statistics in Recognizing Basic Arabic Phonemes

Towards Robust Spontaneous Speech Recognition with Emotional Speech Adapted Acoustic Models

An Examination of the Impact of Customer Relationship Management on Marketing Performance by Clarifying Mediating Role of Innovation and Marketing Memory

Improving the performance of MFCC for Persian robust speech recognition

عنوان ژورنال:

اشتراک گذاری